kafka producer性能调优

您所在的位置：网站首页 › kafka 调优 › kafka producer性能调优

kafka producer性能调优

2023-08-07 00:15| 来源: 网络整理| 查看: 265

1. 介绍

原文来自linkedin的一篇PPT producer-performance-tuning-for-apache-kafka 。

2. 本文的一些前提讨论的kafka版本为0.10.0 没有broker端的再压缩消息都有8字节的时间戳介绍信息 3. 优化目标

给定一个要发送的数据集，在满足持久性、有序性的前提下优化以下两点：

吞吐量延迟

优化专注于优化平均性能，这样对所有的producer都有效。

4. kafka producer原理回顾 4.1 生产者的关键配置 batch.size：基于大小的batching策略 linger.ms：基于时间的batching策略 compression.type：压缩的速度上lz4=snappy1则在管道中排队

PS：接下来的说明，都假设max.in.flight.requests.per.connection=1

5. 生产者调优 5.1 调优工具

生产者调优，主要可以利用kafka-producer-perf-test.sh(org.apache.kafka.tools.ProducerPerformance)。通过测试不同的配置来对比发送效率。

使用方法例子：

./kafka-producer-perf-test.sh --num-records 1000000 --record-size 1000 --topic becket_test_3_replicas_1_partition --throughput 1000000 --producer-props bootstrap. servers=localhost:9092 max.in.flight.requests.per.connection=1 batch.size=100000 compression.type=lz4

PS: kafka 0.8的版本还支持thread-num等选项，现在0.10.1中还没有，不过已经有issue在解决了。相信马上会有了。详情见： KAFKA-3554

3554修复后会有如下功能：

--num-threads: 发送消息的线程数 --value-bound: The range of the random integer in the messages. This option is useful when compression is used.Different integer range simulates different compression ratio. producer metrics: 在使用ProducerPerformance的时候，还会打印一系列metrics。

关于第三点，是以前没有的特性。这个对生产者调优十分重要。使用ProducerPerformance的时候，打印的度量信息有:

Select_Rate_Avg (The rate that the sender thread runs to check if it can send some messages) Request_Rate_Avg Request_Latency_Avg (Not including the callback execution time) Request_Size_Avg (After compression) Batch_Size_Avg (After compression) Records_Per_Request_Avg Record_Queue_Time_Avg Compression_Rate_Avg

PS：以上度量信息，需要至少1分钟运行时间才能保证稳定。

使用例子：

./kafka-producer-perf-test.sh --num-records 1000000 --record-size 1000 --topic becket_test_3_replicas_4_partition --throughput 100000 --num-threads 1 --value-bound 50000 --producer-props bootstrap.servers=localhost:9092 compression.type=gzip max.in.flight. requests.per.connection=1